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Abstract. Let M n be a random element of the unitary, special orthogonal, or unitary 
symplectic groups, distributed according to Haar measure. By a classical result of Diaconis 
and Shahshahani, for large matrix size n, the vector (Tr(M n ), Tr(M^), . . . , Tr(M*)) tends 
to a vector of independent (real or complex) Gaussian random variables. Recently, Jason 
Fulman has demonstrated that for a single power j (which may grow with n), a speed of 
convergence result may be obtained via Stein's method of exchangeable pairs. In this note, 
we extend Fulman's result to the multivariate central limit theorem for the full vector of 
traces of powers. 



One aspect of random matrix theory concerns random elements of compact Lie groups. A 
classical result, due to Diaconis and Shahshahani [DS94J, is as follows: Let M n be an element 
of U n , O n , or USp 2n , distributed according to Haar measure. Then, as n — > oo, the vector 

(Tr(Af n ),Tr(M2),...,Tr(M n rf )) 

converges weakly to a vector of independent, (real or complex) Gaussian random variables. 
The original proof deduced this from exact moment formulae, valid for n sufficiently large 
(see also |Sto05j ). Different versions of the moment computations, also taking care of SO n , 
have been proposed in [PV04j and [H R03] . 

Subsequently, the speed of convergence in the univariate version of this result was studied 
by Charles Stein |Ste95j . who proved that in the orthogonal case the error decreases faster 
than any power of the dimension, and Kurt Johansson |Joh97| . who obtained exponential 
convergence. While Johansson's approach had its roots in Szego's limit theorem for Toeplitz 
determinants, Stein used the "exchangeable pairs" version of a set of techniques that he 
had been developing since the early 1970s (see |Ste72j ) and that nowadays is referred to as 
"Stein's method". Recently, Jason Fulman |FullO] has proposed an approach to the speed 
of convergence for a single power of M n , based on combining Stein's method of exchange- 
able pairs with heat kernel techniques. While producing weaker results on the speed than 
Stein's and Johansson's, his theorems apply to the case that the power grows with n. 

Furthermore, his techniques seem more likely to be useful in contexts beyond the standard 
representations of the classical groups. 

In this note, we extend Fulman's approach and results to a multivariate setting, making use 
of recent extensions, due to Chatterjee, Meckes, Reinert, and Rollin C'MOSl [RR09|, [Me c09j. 
of Stein's method of exchangeable pairs to cover multivariate normal approximations. This 
yields, to the best of our knowledge, the first rates of convergence result in the multivariate 
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CLT for traces of powers on the classical compact groups, or, in any case, the first one that 
allows for the powers to grow with n. Our set-up contains Fulman's as a special case, and we 
recover, in Wasserstein distance, the rates that were proven by him in Kolmogorov distance 
(see Remark [3T21 below) . 

We will review Meckes' version of the multivariate exchangeable pairs method ([Mec09j) in 
Section [TJ stating a complex version of her results that will be useful for the unitary case. 
This will lead to a slight improvement on Fulman's result even in the one dimensional (single 
power) case, in that now the convergence of the (complex) trace of a power of a random Haar 
unitary to a complex normal distribution can be quantified. 

Fulman's approach consists in constructing useful exchangeable pairs from Brownian motion 
on the connected compact Lie groups K n = U n , SO„, or USp 2n . This amounts to studying 
the heat kernel on K n , and in particular the action of the Laplacian on power sum symmetric 
polynomials, which express products of traces of powers of group elements in terms of their 
eigenvalues. This is reviewed in Section |2j Explicit formulae are due to Rains |Rai97] . They 
are rooted in Schur-Weyl duality as explained in |Lev08] . 

In Sections [31 IH and0 we then state and prove rates of convergence results in the multivariate 
CLT for the unitary, special orthogonal, and unitary symplectic groups. 

1. Exchangeable pairs and multivariate normal approximation 

The approach of univariate normal approximation by exchangeable pair couplings in Stein's 
method has a long history dating back to the monograph [Ste86] by Charles Stein in 1986. 
In order to show that a given real valued random variable W is approximately normally 
distributed, Stein proposes the construction of another random variable W on the same 
probability space as W such that the pair (W, W) is exchangeable, i.e. satisfies the relation 

(W, W) = (W, W), and such that there exists A GjO, 1[ for which the linear regression prop- 
erty W\W — W'\W] = —XW holds. In this situation, the distance between W and a standard 
normal Z can be bounded in various metrics, including Wasserstein's and Komogorov's. 
The range of examples to which this technique could be applied was considerably extended in 
the work |RR97j of Rinott and Rotar who proved normal approximation theorems allowing 
the linear regression property to be satisfied only approximately. Specifically, they assumed 
the existence of a "small", random quantity R such that W\W — = —XW + R. 

In |CM08j . Chatterjee and Meckes proposed a version of exchangeable pairs for multivariate 
normal aproximation. For a given random vector W = (Wi, . . . , Wd) T they assume the 

existence of another random vector W = (W[, . . . , W' d ) T such that W = W and of a 
constant XeR such that E[W - W\W] = XW. In |RR09] Reinert and Rollin investigated 
the more general linear regression property K[W' — W\W] = —AW + R, where now A is 
an invertible non-random d x d matrix and R = (R\, . . . , Rd) T is a small remainder term. 
However, in contrast to Chatterjee and Meckes, Reinert and Rollin need the full strength of 
the exchangeability of the vector (W 7 ,!^'). Finally, in |Mec09j . Elizabeth Meckes reconciled 
the two approaches, allowing for the more general linear regression property from [RR09J 
and using sharper coordinate-free bounds on the solution to the Stein equation suggested by 
those from jCM08j . 

Both [CM08] and |Mec09] contain an "infinitesimal version of Stein's method of exchangeable 
pairs", i.e., they provide error bounds for multivariate normal approximations in the case 
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that for each t > there is an exchangeable pair (W, Wt) and that some further limiting 
properties hold as t — > 0. It is such an infinitesimal version that will be applied in what 
follows. 

The abstract multivariate normal approximation theorem provided by Meckes will be suf- 
ficient for the special orthogonal and symplectic cases of the present note, and the unitary 
case needs only a slight extension via basic linear algebra. So it is not really necessary to 
explain how these approximation theorems relate to the fundamentals of Stein's method (see 
[CS05J for a readable account). But it is crucial to understand that there are fundamental 
differences between the univariate and multivariate cases. So a few remarks are in order. The 
starting point of any version of Stein's method of normal approximation in the univariate 
case is the observation that a random variable W is standard normal if, and only if, for all 
/ from a suitable class of piecewise C 1 functions one has 

E[f(W) - Wf{W)} = 0. 

The quest for an analogous first order characterization of the multivariate normal distribution 
has proven unsuccessful, so Chatterjee and Meckes work with the following second-order 
substitute (see |M ec09l Lemma 1]): Let £ be a positive semi-definite d x <i-matrix. A d- 
dimensional random vector X has the distribution iV(0, E) if, and only if, for all / £ Cj?(lR d ) 
the identity 

E [(Hess /(X), S) HS - (X, V/(X)>] = 

holds (where HS stands for Hilbert-Schmidt, see below). Among the consequences of this 
is that the multivariate approximation theorems are phrased in Wasserstein distance (see 
below) rather than Kolmogorov's distance 

sup Lt(] — oo,t\) — v(] — oo,£])| 

for probability measures fi, v on the real line, i.e., the distance concept in which Fulman's 
univariate theorems are cast. 

For a vector x £ M. d let ||x||2 denote its euclidean norm induced by the standard scalar 
product on R d that will be denoted by (•, •). For A, B £ R dxk let (A,B) HS ■= Tt(A t B) = 

=i a ijbij be the usual Hilbert-Schmidt scalar product on 
M. dxh and denote by || • ||hs the corresponding norm. For random matrices M n ,M £ R fcxd , 
defined on a common probability space (Q,A,W), we will say that M n converges to M in 
L : (|| ■ ||hs) if \\M n - M ||hs converges to in L X (P). 

For A £ M. dxd let ||A|| op denote the operator norm induced by the euclidean norm, i.e., 
1 1 A 1 1 op = sup{||Ar||2 : || x || 2 = 1}. More generally, for a fc- multilinear form ip : (M. d ) k — > R 
define the operator norm 

lollop := sup {ip(ui, . . . ,u k ) : uj £ R d , \\uA\ 2 = 1, j = 1, . . . , k } . 

For a function h : M. d — > K define its minimum Lipschitz constant M\(h) by 

M 1 (h) := sup l^-y £ [0,00]. 

x^y \\X — y\\2 

If h is different iable, then M\{h) = sup xgR d \\Dh(x) || op . More generally, for k > 1 and a 
(k - l)-times differentiable h : R d ->• R let 
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x^y \\x-y\\2 

viewing the (k — l)-th derivative of h at any point as a (k — 1) -multilinear form. Then, if 
h is actually fc-times differentiable, we have Mk{h) = sup^j^ ||Z? fe /i(x) || op - Having in mind 
this identity, we define M (h) := ||/i||oo- 

Finally, recall the Wasserstein distance for probability distributions /i, v on (R d , B d ). It is 
defined by 

d w (/j,,v) := sup j| J hdy, - J hdu\ : h : R d -> R and M x (h) < l| . 

Now we are in a position to state the abstract multivariate normal approximation theorems 
that will be applied in this note, starting with the real version, taken from |Mec09t Thm. 4]. 
Z = (Zi,...,Zd) T denotes a standard rf-dimensional normal random vector, £ e R dxd a 
positive semi-definite matrix and := Y}I 2 Z with distribution N(0, £). 

Proposition 1.1. Let W,W t (t > 0) be R d -valued L 2 (P) random vectors on the same 
probability space (O, A, P) such that for any t > the pair (W, W t ) is exchangeable. Suppose 
there exist an invertible non-random matrix A, a positive semi-definite matrix S, a random 
vector R = (R\, . . . , Rd) T , a random d x d-matrix S, a sub-a-field J 7 of A such that W is 
measurable w.r.t. T and a non-vanishing deterministic function s : ]0, oo[— > R such that the 
following three conditions are satisfied: 

(i) 4yE[W t - W\F] ^ -AW + Rm L X (P). 



(ii) ^E[(W t - W)(W t - W) T \F] -—I 2AS + S in ■ || HS ). 
fiii) For each e > 0, 



lim^-E 

s (t) 



\W t - W\\ 2 l{||Wt-W||!>e} 



0. 



Then 

(a) For each h e C 2 (R d ), 

\E[h(W)}-E[h(Zx)}\ < HA-lop i^h{h)E[\\R\\ 2 ] + ^M 2 {h) E[||5|| HS ] 

(b) If £ is actually positive definite, then 

d w {W, Z E ) < IIA-lop (e[||J2|| 2 ] + -I^IIS-^IU E[||5||hs]) • 

Remark 1.2. In applications it is often easier to verify the following stronger condition in 
the place of (iii) of Proposition 11.11 

(Hi)' lim^^)E[||Wi-W||U =0. 
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Now we turn to a version of Proposition 11.11 for complex random vectors, which will be 
needed for the case of the unitary group. || • || op and || • ||hs extending in the obvious way, we 
now denote by Z = (Zi, . . . , Z d ) a d- dimensional complex standard normal random vector, 
i.e., there are iid N(0, 1/2) distributed real random variables X 1 ,Y 1 , . . . , X d ,Y d such that 
Zj = Xj + iYj for all j = 1, . . . , d. 

Proposition 1.3. Let W,W t (t > 0) be C d -valued L 2 (P) random vectors on the same 
probability space (Q, A, P) such that for any t > the pair (W, W t ) is exchangeable. Suppose 
that there exist non-random matrices A, £ <G C dxd , A invertible, £ positive semi- definite, a 
random vector R G C d , random matrices S,T G C dxd , a sub-a-field T of A such that W 
is measurable w.r.t. T , and a non-vanishing deterministic function s : ]0,oo[— > R with the 
following properties: 

(i) ^E[W t - W\F] ^ -AW + Rm L^P). 

(ii) [(W t - W)(W t - W)*\T] ^ 2AS + S m L\\\- || hs ). 
[(W t - W)(W t - Wf\F] ^Tin L\ 



m 



(iv) For each e > 0, 



lim 



-E 



Then, 
(a) 



t^o s(t) 

If he C 2 (R 2d ), then 
\Eh(W) -E/i(S 1/2 Z)| < ((A" 1 



W t -W\ 



us 



\Wt-W\\l>e} 



0. 



I op 



Afi(A)E||fl|| 2 



Vd 



E[||5| 



HS 



\T\ 



HSj 



(b) // S is actually positive definite, then 

d w (W,T}l 2 Z) < \\A-% P fE||i2|| 2 + -i=||E- 1 / 2 || pE[||S||Hs+l|T||Hs" 

V y In 



Remark 1.4. 

(i) Remark 11.21 above also applies to the present condition (iv). 

(ii) If Y is a centered d- dimensional complex normal random vector, then there exist 
A G <C dxd and a complex standard normal Z G C d such that Y = AZ. In this 
case, E[YY T ] = AE[ZZ T ]A T = 0, since E\ZZ T \ = 0. This heuristically motivates 
condition (iii) above for W close to such a Y , if one thinks of T as a small remainder 
term. For an algebraic rationale see ()2]) below. 



Proof of Proposition \1.3l The proof is by reduction to the real version in Proposition 11.11 
To this end, we need some preparation. To each z — (zi, . . . , zj) 

z R := (Re(>i), ImOi), . . . , Re(z d ), lm(z d )) T 



]T G C d assign the vector 

TED 2d 



Given A = (a jk ) G C dxd define A K G 



>2dx2d 



uniquely by requiring that (Az)m = A^z^ for 



all zeC In concrete terms, A^ consists of 2 x 2 blocks, where the block at position (j, k) 



is given as 



Re(a jk ) 
Im(a ifc ) 



lm(a jk ) 
Re(a jk ) 
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Observe that 

(AB) R = A R B R , (A*) R = (A M ) T , and {AA*) R = A R (A*) R = A R (A^ 

and that for d = 1 one has to specify whether a scalar is to be interpreted as a vector or 

10 
-1 



as a matrix. Writing J for the Kronecker product I \ A 1 ® I<f, one easily verifies the 



following identity of real 2d x 2d matrices: 

-(z 

2 V y 2 



hzw*) R + hzw T ) R J. (2) 



Now, given the data from Proposition [L3l we will show that the pairs (W Rl (W t ) R ) satisfy the 
assumptions of Proposition 11.11 with A R for A, £' := |S K for S, R R for R and |(Sr + T R J) 
for S. Clearly (W R , (W t )u) is exchangeable and A R is invertible. By ([1]), E' is symmetric and 
non- negative definite. To verify assumption (i), observe that 

1 -E[(W t - W) R \F] = -^E[(W t - W)\J=] R 



s(t) " 1 ^ J a (t)" 



For (ii), by §2§ above, we have 
1 



8(t) 



AW + R) R = -A R W R + R% 
n(W t - W) R (W t - W)l\F] 



1 \n{W t - W){W t - W)*\F} R + lE[{W t - W)(W t - W) T \F} R J 



i->0 



s{t) V2 LV /K 2 

i(2AS + S) R + ^T R J = A R Ti R + i(5 K + T R J) 

= 2A M S' + i(5 R + T M J). 

Since — W / )r||2 = — W\\2, condition (hi) is trivially satisfied. So we may apply 
Proposition 11.11 to the real version We, of W. Note that if Z is distributed according to 
N(0,l2d), then (S') 1//2 Z is equal in distribution to (Y}/ 2 Z) R . Observing that ||A _1 || op = 
IIAilop, ||5r||hs = v^HSHhs, ||T r J||hs = ||T R || H s = \^||^||hs, Ep|| 2 = E||i? R || 2 and 
||(E!')- 1/2 ||pp = v^HSrIop = V^||S -1 ||op yields the result. □ 

Now we construct a family of exchangeable pairs to fit into Meckes' set-up. We do this 
along the lines of Fulman's univariate approach in |FullO] . In a nutshell, let (M t ) t > be 
Brownian motion on K n with Haar measure as initial distribution. Set M := M Q . Brownian 
motion being reversible w.r.t. Haar measure, (M, M t ) is an exchangeable pair for any t > 
0. Suppose that the centered version W of the statistic we are interested in is given by 
W = (fi(M), . . . , fd(M)) T for suitable measurable functions fx, ■ ■ ■ , fd- Defining W t = 
(fi(M t ), . . . , fd(M t )) T clearly yields an exchangeable pair (W, W t ). 



To be more specific, let t n be the Lie algebra of K n , endowed with the scalar product 
(X,Y) = Ty(X*Y). Denote by A = A Kn the Laplace-Beltrami operator of K n , i.e., the 
diffential operator corresponding to the Casimir element of the enveloping algebra of l n . Then 
(M t )t>o w ih be the diffusion on K n with infinitesimal generator A and Haar measure as initial 
distribution. Reversibility then follows from general theory (see [HelOOj Sec. II.2.4], [IW89, 
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Sec. V.4], for the relevant facts). Let (T t ) t > , often symbolically written as (e* A ) f>0 , be the 

corresponding semigroup of transition operators on C 2 (K n ). From the Markov property of 
Brownian motion we obtain that 

E[f(M t )\M] = (T t f)(M) a.s. (3) 

Note that (t,g) i— >■ (T t f)(g) satisfies the heat equation, so a Taylor expansion in t yields the 
following lemma, which is one of the cornerstones in Fulman's approach in that it shows that 
in first order in t a crucial quantity for the regression conditions (i) of Propositions 11.11 and 
11.31 is given by the action of the Laplacian. 

Lemma 1.5. Let f : K n -t C be smooth. Then 

E[f(M t )\M] = f(M) + t(A/)(M) + 0(t 2 ). 

Remark 1.6. An elementary construction of the semigroup (T t ) via an eigenfunction expan- 
sion in irreducible characters of K n can be found in |Ste70j . This construction immediately 
implies Lemma [T31 for power sums, see Sec.[2J since they are characters of (reducible) tensor 
power representations. 

Remark 1.7. As K n is compact, Lemma Ol implies that lim t _ >0 E[/(M t ) - f(M)\M) = 
a.s. and in L 2 (P). Arguments of this type will occur frequently in what follows, usually 
without further notice. 



2. Power sums and Laplacians 

For indeterminates X\, . . . , X n and a finite family A = (Ai, . . . , A r ) of positive integers, the 
power sum symmetric polynomial with index A is given by 

r n 
k=l j=l 

For A G C nxn with eigenvalues ci, . . . ,c n (not necessarily distinct), we write p\{A) in the 
place of p\(ci, . . . , c n ). Then one has the identity 

r 

Px (A) = l[Tr(A x *). 

k=l 

Using this formula, we will extend the definition of p\ to integer indices by po(A) = Tr(J), 
p„ k {A) = Tr((^) fc ) and p x (A) = U^iPM)- 

In particular, the p\ may be viewed as functions on K n , and the action of the Laplacian A^„ 
on them, useful in view of Lemma [1.51 is available from [Rai97j ILev08j . We specialize their 
formulae to the cases that will be needed in what follows: 
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Lemma 2.1. For the Laplacian Au n on U n , one has 



3-1 

(*) A v n Pj = -njpj - j^2pi,j-i- 

i=i 

3-1 

(it) A\j n Pj,k = ~n(j + k)p j:k - 2jkp j+k - jp k ^2pi,j-i 

i=i 

k-l 

- kpj^pi^i. 
i=i 

3-1 

(Hi) A Un (p^) = 2jkpj_ k - n(j + k)pjp^ - fpi ^ PiJ-i 

i=i 

k-l 



k pjJ2 



pi,k~i- 



i=i 



Lemma 2.2. For the Laplacian Aso n on SO r 



/ i\ -3-1 ■ 3-1 

(i) A SOnPj = — JPj - - 2^Pi,j-i + g l^VVL-j- 

i=i i=i 



/ • • \ a (n - l)(j + k) j i4 k ^4 

(**) ^so„Pj,fc = ^ Pj,k ~ 2 PkZ^Ph3-i ~ g Pj 2^^ fc - ; ~~ ^Pi+fc 

2=1 J=l 
j-1 ^ fc-1 

+ 2 P fe 5Z^'- 2i + 2 Pj ^2 Pk - 21 + i k Pj~ k - 
i=i i=i 

Lemma 2.3. For the Laplacian Ausp 2n on USp 2n , 

(r, , i\ ■ 3-1 ■ 3-1 

/ -\ a 2n + 1 . J v— 1 J\7 

(i) A USPan Pj = ^ JPj - ^2^Pi,j-i - 2 Z^P 2i -J- 

i=i i=i 

(ii) A VSp2n Pj,k = -— y— " C? + fc ) Pi,* - J k Pj+k ~ \ Pk^Pi,j-i - \ Pj ^2pi,k-i 

i=i i=i 

o-i ^ k-l 

- 2 Pk ^Pi- 21 ~ 2 Pj J2 Pk ~ 21 + ^kpj-k- 
i=i i=i 

In what follows, we will need to integrate certain p\ over the group K n . Thus we will 
need special cases of the Diaconis-Shahshahani moment formulae that we now recall (see 
[DS941 IHR031 iPVOil ISto05] for proofs). Let a = (a u . . . , a r ), b = (b h . . . , b q ) be families of 
nonnegative integers and define 
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1 if aj = 0, 

■ if jaj is odd, aj > 1, 

fa(j) ■— \ jaj/2^ a . _ -Qjj j j g Q( J ( J anc j a . j g everL; > 2 ; 

i + Ei a i /2J j d (Z) ( 2d is even > % > L 

Here we have used the notation (2m — 1)!! = (2m — l)(2m — 3) • . . . • 3 • 1. Further, we will 
write 

r q s 

ka ■= A; b := J^jbj, and m := j J' || ^ * ^ 

3=1 3=1 

Lemma 2.4. Let M = M n be a Haar- distributed element of U n; Zi,...,Z r iid complex 
standard normals. Then, if k a ^ k b , 

«(o,6) := E (nW))^ n(Tr(M^)) 6 A = 0. 

\j=i j=i J 

If k a = k b and n > k a , then 

a (a , b) = 5 a , b f[p aj \ = E (f[(y/jZjp flUjZj) 1 

3=1 \3=1 3=1 

Lemma 2.5. If M = M n is a Haar- distributed element of SO„, n — 1 > k a , Z\, . . . , Z r iid 
real standard normals, then 

(r \ / r \ r 

U(Tr(M')p = E (IKVjZj + rijp) =Y[f a (j). (4) 
3=1 J \3=1 / 3=1 

Lemma 2.6. If M = M 2n is a Haar- distributed element of USp 2n , 2n > k a , Z ± , . . . , Z r iid 
real standard normals, then 

E (f[(Tr(Mi))A = E (f[(^jZj - Vj )*A = f[(-l)^ f a (j). (5) 

\j=l / \j=l J 3=1 



3. The unitary group 

Let M = M n be distributed according to Haar measure on K n = U n . For d e N, r — 1, . . . ,d, 
consider the r-dimensional complex random vector 

W := W(d, r, n) := (Tr(M d ~ r+1 ), Tr{M d - r+2 ), Tr{M d )) T . 

Let Z := (Z d _ r+ i, . . . , Z d ) T be an r-dimensional standard complex normal random vector, 
i.e., there are iid real random variables Xd- r +i, ■ ■ ■ , X d , Y^-r+i, • • • , Yd with distribution 
N(0, 1/2) such that Zj = Xj + iYj for j = d — r + 1 , . . . , d. Furthermore, we take £ to denote 
the diagonal matrix diag(<i — r + 1, d — r + 2, . . . , d) and write Z-£ ■— Y}I 2 Z. The present 
section is devoted to the proof of the following 
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Theorem 3.1. If n> 2d, the Wasserstein distance between W and is 



d w (W, Z E ) = O I I . (6) 



In 'particular, for r = d we have 

d w (W, Zv) = o ' 

and /or r = 1 

Ifl<r= [cd\ for < c < 1, tfien 



n 



d 3 / 2 



n 



d w (W,Z E ) = 0( — ). 



n 



Remark 3.2. The case r = 1 means that one considers a single power Tr(M ). In his study 
[FullO] of the univariate case, Fulman considers the random variable Tr ^ - instead. By the 
scaling properties of the Wasserstein metric, the present result implies 

So in this special case we recover the rate of convergence that was obtained by Fulman, 
albeit in Wasserstein rather than Kolmogorov distance. 



In order to prove Theorem 13.11 we invoke the construction that was explained in Section 
[I] above, yielding a family (W, W t )t>o of exchangeable pairs such that W t = W t (d,r,n) = 
(Tr(M?- r+1 ), Tr(Mf~ r+2 ), . . . , Tr(M 4 d )) T . We have to check the conditions of Prop. O As 
to (i), from Lemma [1.51 and Lemma [2.1( i) we obtain 

E[ P] (M t )\M] = Pj (M)+t(A Pj )(M) + 0(t 2 ) 

3-1 

= Pj (M) - tnj Pj (M) - tjJ2Pid-i( M ) + °( t2 )' 

i=i 

hence 

EfoiMt) - Pj (M)\M] = -t Lj Pj (M) + J J^ Plj _ l (M) + 0(t) 
From this we see that 

/ E\p d _ r+l {M t ) - Pd _ r+l {M)\M\ \ 



jE[W t - W\M\ = \ 



E\p d „ r+2 (M t ) - p d ^ r+2 (M)\M] 
V E\p d (M t )- Pd (M)\M] J 
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tends, as t — > 0, a.s. and in L 1 (|| • || H s) to 



/ (d-r + 1) 
(d-r + 2) 



np d _ r+2 + E?=[ +1 Pl,d-r+2-l (M) 



d 



nPd + Yfi=lviA-i ( M ) 



-AW + R, 



J 



where A = diag(nj : j = d — r + 1, . . . , d) and R = (Rd- r +i, • • • , Rd) T with Rj 
—j Xw=i Pi,j-i(M)- (See Remark 11.71 above for the type of convergence.) 

To verify conditions (ii) and (iii) of Prop. 11.31 we first prove the following lemma. 
Lemma 3.3. For j, k = d — r + 1, . . . , d, one has 
(i) 

E[( Pj (M t )- Pj (M))( Pk (M t ) - Pk (M) | M\ = -2tjk Pj+k (M) + 0(t 2 ) 

(ii) 

E[( Pj (M t ) - Pj (M))(j%(M t ) - p-k{M)) | M] = 2tjkp^ k (M) + 0(t 2 ) 
Proof. By well known properties of conditional expectation, 

E[( Pj (M t ) - Pj (M))(p k (M t )- Pk (M))\M] 
= E[p,(M t )p k (M t )\M] - Pj (M)E\p k (M t )\M] - p k (M)E[p 3 (M t )\M] + Pj (M) Pk (M). 
Applying Lemma 11.51 and 12.1( h) to the first term yields 

E\p jtk (M t )\M] = Pjik (M) + t(A Pjjk )(M) + 0(t 2 ) 

= Pj, k (M) + t i-n{j + k) Pj , k (M) - 2jk Pj+k {M) - j Pk (M)J2PU-i( M ) 



1=1 



k-l 



-k Pj (M)J2pi,k-i(M)\ + 0(t 2 ). 
i=i J 

Analogously, for the second term, 

Pj(M)E[p k (M t )\M] = Pj (M)(p k (M) + t(A Pk )(M) + 0(t 2 )) 

= p hk {M) + t P j(M) ^-nk Pk (M) - kJ2Pi,k-i{M)^j + 0(t 2 ) 

k-l 

= p hk {M) - tnk Pj>k (M) - tk Pj {M) J2Pi,k-i(M) + 0(t 2 ), 



i=i 



and by symmetry 



Pk {M)E\p 3 {M t )\M\ = Pj , k (M) - tnj Pjik (M) - tjp k {M) ^ PhM {M) + 0(t 2 ). 
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Summing up, one obtains 

E[( Pj (M t ) - Pj (M))( Pk (M t ) - Pk (M))\M] = -2tjk Pj+k (M) + 0(t 2 ), 
proving the first assertion. For the second, we compute analogously 
E[( Pj (M t )- Pj (M))(p^(M t )-pj-(M)\M] 
= E\p 3 Tk(M t )\M]- Pj (M)E^(M t )\M} -p^(M)E[ Pj (M t )\M] + Pj p^M), 
and we have by Lemma I2.1( iii) 

E\p jPk -(M t )\M} = PjPk -(M) + tA(pjpj-)(M) + 0(t 2 ) 
= PjPk ~(M) + t (2jk Pj - k - n(j + k) PjPk ~(M) 

i-i fc-i \ 

-ffi(M) Y,PU-i( M ) - k P j(M) Y,Pv^i{M) + 0(t 2 ) 



1=1 



as well as 



Pj (M)E[p- k (M t )\M] = Pj {M) fa{M) + *(Aj*)(M) + 0(t 2 )) 
PjPi(M) + t Pj (M) (-nkpk(M) - kJ2Phk^i(M) \ + 0(t 2 ) 



1=1 



and 



P j(M)E\p j (M t )\M] = Tk{M) ( Pj (M) + t(A Pj )(M) + 0(t 2 )) 

= PjPk -{M)+t Pk -{M) ^-nj Pj {M)-jJ2Pid-i( M )j +0(t 2 ). 

Summing up, one has 

E[(pj(M t ) - P] {M)){T k {M t ) - Tk{M))\M] = 2tjk Pj - k + 0(t 2 ). 



□ 



Now we are in a position to identify the random matrices S, T of Prop. 11.31 By Lemma 
[3731 \E[{W t - W){W t - W) T \M] converges almost surely, and in \}{\\ ■ || HS ), to T = 
{tj k )j, k =d-r+i,...,d, where t jk = -2jkp j+k (M) for j,k = d - r + 1, . . . , d. Observing that 
AS = diag(nj 2 : j = d - r + 1, . . . , d), one has that ~E[(W t - W)(W t - W)*\M] converges 
almost surely, and in L 1 (|| • ||hs), to 2AS + S, where S = (sj k )j ;k =d-r+i,...,d is given by 



0, if j = k, 

2jk Pj „ k (M), aj^k. 

Next we will verify condition (iv), using Remark 11.41 Specifically, we will show that E[||W| — 
Wf 2 } = 0(t 3 / 2 ). Since 

d 

E[\\W t -W\\l] < MWtj-WjXW^-WkXWtj-Wdl] 

j,k,l=d-l+l 
d 

< ( E Vi W U - W,\ 3 ]E[\W ttk - W k \ 3 ]E[\W t>l - ^| 3 ]) V3 , 

j,k,l=d— r+1 
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it suffices to prove that E[\W tJ - Wj\ 3 } = 0(t 3/2 ) for all j = d - r + 1, . . . , d. This in turn 
follows from the next lemma, since 



2n\l/2 



Lemma 3.4. For j = d — r + 1, . . . , d, n > 2d, 

(i) E[\W tij -W j \ 2 ] = 2j 2 nt + 0(f), 

(ii) E[|W^-J^| 4 ] = 0(t 2 ). 

Proof. By Lemma 13.31 (ii). 



E[\W t;j - W,\ 2 } = E[{ Pj {M t ) - Pj {M)){p]{M t ) - pj(M))] = E[2tfn + t 2 )} = 2tj 2 n + 0(t 2 ), 



establishing (i). Turning to (ii), one calculates that 



= E [(W t>j - Wj) 2 • (Wy - W 3 ) 2 ] 

= E [( Pj (M t ) - p 3 (M)) 2 (pj(M t ) - p](M)) 2 ] 

= E [(p 2 (M t ) - 2 Pj (M) Pj (M t ) +p 2 (M))(p- 2 (M t ) - 2pj(M)p](M t ) + p] 2 (M))} 
= E [p 2 (M t )p] 2 (M t )] - 2E [p 2 (M t )p-(M t )p-(M)] - 2E [pj 2 (M t ) Pj (M t ) Pj (M)] 
+E [p 2 (M t )p] 2 (M)] +E[p] 2 (M t )p 2 (M)] +4E[p J (M t )p-(M t )p 3 (M)p-(M)} 
-2E [p 2 (M)p-(M)p-(M t )] - 2E [p- 2 (M)p 3 (M)p 3 (M t )] + E [p 2 (M)p- 2 (M)] 
=: S 1 - 2S 2 - 2S 3 + S 4 + S 5 + AS 6 - 2S 7 - 2S 8 + S 9 . 



By exchangeability, we have Si = Sg, S 3 = S 2 , £4 = S 5 , Sj = S 2 , S 8 = Sj = S 3 . 

Now, for n > 2d, i.e., large enough for the moment formulae of Lemma [2.41 to apply for all 

j = d — r + 1, . . . ,d, 



£1 = 2j 2 , 

S 8 = E[p- 2 (M) Pj (M) Pj (M t )] =E[p- 2 (M)p J (M)E[p 1 (M t )\M]] 

= E [p- 2 (M)p,(M) ( Pj (M) + t(A Pj )(M) + 0(t 2 ))] 

= E [p- 2 {M)p 2 {M)] + E [p- 2 {M)p {M) 0(t 2 )] 

/ i-i 
p- 2 {M)p 1 {M) -nj Pj {M)-jJ2Pi,j-i(M) 



+ t E 



1=1 



j'-i 



2j 2 + 0(t 2 ) - tn 3 2j 2 -tj^E W{M) Pj {M) PhM {M)] = 2j 2 - 2tnf + 0(t 2 



1=1 
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Hence, S 3 = S 2 = ~th = S 8 = 2j 2 - 2tnj 3 + 0(t 2 ). On the other hand, 
S, = E[p%M t )p^\M)]=E[p- 2 (M)E[p 2 J (M t )\M}} 

= E [pJ 2 (M) (p 2 (M) + t (ApJ)(M) + 0(t 2 ))] = E [p- 2 (M)p 2 (M)} + 0(t 2 ) 

+ t E Y 2 {M) (^-2njp 2 (M) - 2fp 2j (M) - 2jpj (M) E Pij-i (M) 

3-1 

= 2f - Antf - 2tfE [p- 2 (M)p 2j (M)} - 2tj [p- 2 (M)p,(M)p lJ _ l (M)} + 0(t 2 ) 



1=1 

;1 /i+„„-3 i r\l+2\ 



= 2j 2 -Atnf + 0(r). 
Finally, 

S 6 = E[p J (M t )p-(M t )p J (M)p-(M)]=E[p J (M)p-(M)E[p^ 
= E [ Pj (M)pj(M) ( Pj (M)p-(M) + t (A Pj p-)(M) + 0(t 2 ))] 



E [p 2 {M)p- 2 {M)} + 0{t 2 ) + t E 



Pj {M)p-{M) 



2j 2 n - 2nj Pj {M)p](M) - JPj (M)Y,P%Fi(M) - jp](M) J2pij-i( m ) 

i=i i=i 
= 2j 2 + 2tfnE \pj(M)pj(M)] - 2ntjE [p 2 (M)p- 2 (M)] 
j-i j'-i 
-tj J2 E [p%M)p-(M)W-i(M)} - tj E E [PjWPij-iWpfiM)] + 0(t 2 ) 
i=i i=i 
= 2j 2 + 2tnf - intf + 0{t 2 ) = 2j 2 - 2ntf + 0(t 2 ). 

Putting the pieces together, 

E [\W t j - Wj\ A ] = 2 -j 2 - 8(2j 2 - 2tnf) + 2(2j 2 - Atnf) + 4(2j 2 - 2ntf) + 0(t 2 ) 
= j 2 (4 - 16 + 4 + 8) + tnfilQ - 8 - 8) + 0(t 2 ) = 0(t 2 ). 
as asserted. □ 
With the conditions of Theorem 11.31 in place, we have 



d w (WX /2 Z)<\\k- l \\ op fE||i2|| 2 + -i=||S- 1 /2|| op E[||5||Hs+||T||Hs 

V V 27T 



(7) 



To bound the quantities on the right hand side, we first observe that ||A || op = n( - d „ r+1 ) and 

lollop = 7ra- Now 

d d j-i 

n\R\\i= E E i 2 E E ^M&^wi- 

j=d— r+1 j=d— r+1 (,m=l 

For n > d, Lemma 12.41 implies 

E[ Ri _ ; (M)p^— (M)] = <f J>m E fpjj-i(M)pj3= 7(M)] 
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and 



Hence 



E| P y_ l (M)Py=i(M)] = J 2/(j Z) ' ' ' 



l(j — I), otherwise J < ^' 



3-1 



e\\r\\i < 2 j 2 £'0'-o 

j=d-r+l J=l 
j=d— r+1 

= 5 E 

j'=d— r+1 

fc=i 
1 r 

= - J](A; 5 + 5k A (d - r) + 10A; 3 (d - r) 2 + 10A; 2 (d - rf + 5A;(d - r) 4 + (d 
k=l 
1 r 

- - J2( k3 + 3A;2 ( rf - r ) + 3k (d ~ rf + (d - rf) 
3 A 

£fc5 + ( d _ r ) ( 5 J> 4 -3j> 2 

_k=l V fc=l fc=l 

/ r r \ 

+ (d-r) 2 10^A; 3 -3^A; + (d - rf ( 10 ^ A; 2 - r 



1 

3 



fe=i 



fc=i 



fc=i 



+ (d-r) 4 5^A; + r(d 



fc=i 



1 



< - [r 6 + 5(d - r)r 5 + 10(d - r) 2 r 4 + 10(d - r) 3 r 3 + 5(d - r) 4 r 2 + (d - r 
3 

= 0(max(r 6 ,r(d - r) 5 )), 
hence E||i2|| 2 < v^Pl = 0(max(r 3 , ^¥(d - r) 5 / 2 )). On the other hand, 



n\s\\h = E E i a i*i 2 

j,k=d— r+1 

= 8 ^ j 2 A; 2 E[^ fe (M)p-^(M)] = 8 £ fk 2 (j-k 



d-r+l<k<j<d 



d-r+l<k<j<d 



16 CHRISTIAN DOBLER AND MICHAEL STOLZ 

= § E ^ 2 - 8 E ^ 

d-r+l<k<j<d d-r+l<k<j<d 

d-r+2 d-r+2 

= X E \f + Q f( d ~ r) + I5f(d - rf + 20f(d - rf + I5j 2 (d - rf + 6j(d - rf + (d - rf] 

6 3=2 

< r 7 + 6r 6 (d - r) + 15r 5 (d - r) 2 + 20r 4 (d - rf + 15r 3 (d - r) 4 + 6r 2 (d - r) 5 + r(d - rf 
= 0(max(r 7 , r(d — r) 6 )). 

So we have obtained that 



E||5||hs < sJnSWls = 0(max(r 7 / 2 , ^{d - rf)). 

As to ||T||hs, for n > 2d we have 

d d 
E[||T|| 2 S ] = n\t ik \ 2 }= E 4j 2 A; 2 E^ +fc (M)^(M)] 

j,k=d— r+1 j,k=d—r+l 

= E 4^+*) = 4 E ^ 2 + 4 E *v 

j,k=d—r+l j,k=d—r+l j,k=d~r+l 

d \ / d 

E i'l E * 2 

\j=d— r+1 / \j=d— r+1 

= 8(^(j + rf-r) 3 ) (£(* + d- r ) : 

\i=l / \fc=l 

= 8 (Ej 3 + 3j 2 (rf-r) + 3j(rf-r) 2 + (rf-r) 3 ) ( ^ A; 2 + 2A;(d - r) + (d-rf 

\j=l J \k=l 

< 8(r 4 + 3r 3 {d - r) + 3{r 2 {d - r) 2 + r(d - r) 3 )(r 3 + 2r 2 (d - r) + r(d - rf ) 
= 0(max(r 7 ,r 2 (d - rf )). 

hence 

E||T||hs = 0(max(r 7/2 , r(d - rf' 2 ). 
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Plugging these bounds into (J7|), we obtain 
d w {W,E 1/2 Z) 

O (max(r 7 / 2 , y/r(d — r) 3 ) + max(r 7//2 , r(d — r) 5 / 2 ) 



< 



( O (max(r 3 , y/f(d - r) 5/2 ) + 

- r + 1) \ ' 



n(d-r + l)\ ' \/d-r+l 

y/d — r + lmax(r 3 , y/r(d — r) 5 / 2 ) + max(r 7//2 , y/r(d — r) 3 ) + max(r 7//2 , r(d — r) 5 / 2 } 



O 



n{d-r + iy/ 2 
In the case that r < d — r, 



© = o 

whereas in the case that d — r < r one obtains 

(El) = O 



y/r(rf-r + l)(rf-r) 5 / 2 \ _ f ^/¥(d-r) 3 / 2 
n(d — r + 1) 3 / 2 J \ n 



r 7/2 

n(d-r + l) 3 / 2 7 ' 
This yields the first claim, and the others follow easily. 

4. The special orthogonal group 

Let M = M n be distributed according to Haar measure on K n = SO n . For d G N, r = 
1, . . . , d, consider the random vector W := W(d, r, n) := (/ d _ r+ i(M), f d ^ r+2 (M), . . . , f d (M)) T , 
where 



Pi, j odd 
Pj — 1, j even. 



Let Z = (Zd-r+i, ■ ■ ■ , Zd) T denote an r-dimensional real standard normal random vector and 
write Zy, ■= S 1 / 2 Z, where £ := diag(c? — r + l,d — r + 2,...,d). The objective of this section 
is to prove the following 

Theorem 4.1. Ifn > 4d+l ; t/ie Wasserstein distance between W and Z-% is of the same 
order as in the unitary case, namely 



d w {W, Z E ) = O 



' max {(^TTF72' {d-rf/ 2 ^ 



n 



To prove Theorem 14. 1[ we invoke a family (W, W t )t>o of exchangeable pairs such that W t '■— 
W t {d, r, n) := (f M (M t ), f M (M t ), f d {M t )) T for all t > 0. We will apply Prop. EU 
so the first step is to verify its conditions. For condition (i) we will need the following 

Lemma 4.2. For all j = d - r + 1, . . . ,d 

E[W t>j - Wj\M] = E[f,.(M t ) - fj{M)\M] = t ■ (- {n ~ 1)j f 3 {M) + R 3 + 0(f) J , 
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where 

■ 3-1 ■ 3-1 

R i = - J nJ2P l '3~i( M ) + ij2P^ M ) if j is odd, 
i=i i=i 

R j = J-^^-l^p^^ + l^p^M) if j is even. 

i=i i=i 

Proof. First observe that always fj(M t ) — fj{M) = pj(M t ) — pj(M), no matter what the 
parity of j is. By Lemmas 11.51 and 12.21 



E[p 3 (M t ) - Pj (M)\M] = t(A Pj )(M) + 0(t 2 ) 



1=1 1=1 



which is 

3-i ■ 3-1 



t -^^fAM) + {- J - J2pi,3-i(m) + \ J>^( M )) + °(*) 



(n - l)j 
if j is odd and which is 



t(-^^MM) + ("^^ - f X>,^(M) +fi>- 3 -(M)) + 0(t) 

\ Z=l 1=1 / 

if j is even. This proves the lemma. □ 
From Lemma 14.21 (see also Remark 11.71) we conclude 

-E[W t - W\M\ ^ -AW + R almost surely and in L X (P) , 

t 

where A = diag , j — d — r + 1, . . . , d\ and R = (Rd- r +i, • • • , Rd) T ■ Thus, condition 

(i) of Prop. 11.11 is satisfied. In order to verify condition (ii) we will first prove the following 

Lemma 4.3. For all j,k — d — r + 1, . . . ,d 

E[( Pj (M t ) - Pj (M))(p k (M t ) - Pk (M))\M] =t{jk Pj - k (M) - jkp J+k (M)) + 0(t 2 ). 

Proof. By well-known properties of conditional expectation 

E\(p 3 (M t )-p 3 (M)){p k (M t )-p k (M))\M] 
= E\p jtk (M t )\M\ - Pj (M)E\p k {M t )\M] -p fc (M)E[p i (M t )|M] + Pj , k (M) (9) 
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By Lemmas 11.51 and part (ii) of Lemma 12.21 we see that 
E[ Phk (M t )\M] = p jtk (M) + t (Ap j>k ) (M) + 0(t 2 ) 



Pj,k(M) +t(- {n 1} 2 ° + k) Pj , k (M) - J - Pk (M) J^Pu-iiM) - \p 3 {M) J2m-i(M) 



i=i i=i 



k 



- kjp j+k (M) + 3 -p k {M) Y,Pj-2i{M) + - Pj {M) Y,Pk-2i{M) + jk Pj . k (M)^ 



i=i i=i 

t2\ 



+ 0(t 2 ) (10) 
Also, by Lemma [1.51 and part (i) of Lemma [2.21 

- Pj (M)E\p k (M t )\M] = —pj(M) ( Pk (M) + t (A Pk ) (M) + 0(t 2 )) 
= - Pjik (M) - t Pj (M) -L^ Pfc (M) - -^PiM-m + 1 5>,_ fe (M) + 0{f) 

\ ~~ 1=1 1=1 / 

(tl h ^ — i 

= -Pj, k {M) + 1 2 Pj,h{M) + t- Pj (M)J2pi,k-i( M ) 

i=i 

b k ~ l 

- t-p,(M)J2P2i-k(M) + 0(t 2 ), (11) 



T 

i=i 



and for reasons of symmetry 

-p k (M)E[ Pj (M t )\M] = -p j>k (M) + t {n ~ 1)j p^(M) + t 3 - Pk (M) £> y _ { (M) 



i=i 



- t J - Pk (M)J2P2i-A M ) + 0(t 2 ). (12) 
i=i 

Plugging (HDD, (HU) and (H2D into © and noting that for j E N 



p^j(M) = Ti(M- j ) = Tr ((M T ) j ) = Tr ((AP) T ) = Tr(M J ) = p^-(M) , 
we see that many terms cancel, and finally obtain 

E[( Pj (M t ) - Pj (M))(p k (M t ) -p k {M))\M\ =t(jkp 3 . k (M)-jkp J+k (M)) + 0(t 2 ) . 

□ 

Observing as above that, regardless of the parity of j, we have fj(M t ) — fj(M) = pj(M t ) — 
Pj(M), we can now easily compute 

jE[(W t , - W 3 )(W t , k - W k )\M] = \m,fj(M t ) - f 3 (M)){f k (M t ) - f k (M))\M] 

= jE[(p J (M t )-p J (M))(p k (M t )~p k (M))\M]=jkp 3 . k ^ 

jkpj- k (M) — jkpj +k (M) a.s. and in L 1 
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for all j, k — 1, . . . , d. Noting that for j = k the last expression is j 2 n — j 2 p 2 j(M) and that 
2AS = diag((n — l)j 2 , j — 1, . . . , d) we see that condition (ii) of Prop. 11.11 is satisfied with 
the matrix S = {S jik ) jjk =i,...,d given by 



S 



j 2 (l-p 2j (M)), j = k 

jkp^ k {M) - 3 k P]+k (M) } j^k. 

In order to show that condition (iii) of Prop. 11.11 holds, we will need the following facts: 

Lemma 4.4. For all j = 1, . . . , d, n > Ad + 1, 

« E[(W tJ -W J ) 2 ]=tj 2 (n-l) + 0(t 2 ). 
(ii) E{(W t , 3 -W 3 )*} = 0(t 2 ). 

Proof. As for (i), by Lemma [4.31 

n(W t>j - Wj) 2 ) = E [( Pj (M t ) - Pj {M)f] 
= E[E [( Pj (M t ) - p,(M)) 2 | M}} = E [tf (n - p 2j (M)) + 0(t 2 )} 
= tj> - 1) + 0(t 2 ) , 

since by Lemma 12 . 51 E [poj (M)] = 1. 
For claim (ii) we compute 

E [(W t>j - W 3 f] = E [(pj(M t ) — pj(M)) 4 ~\ 
= E[ Pj (M t ) 4 } + E[j9,(M) 4 ] - 4E[ Pj (M t ) 3 P:j (M)} - 4E[ Pj (M) 3 p 3 (M t )] + 6E[ Pj (M t ) 2 p 3 (M) 2 } 
= 2E[p,(M) 4 ] - 8E[ Pj (M) 3 p,(M t )] + 6E[p h ,(M)p hJ (M t )] , 

where the last equality follows from exchangeability. By Lemma 11.51 and part (ii) of Lemma 
12.21 for the case k = j 

E\p j;j (M) Pj!j (M t )} = E \p 3 ,{M)E[p hJ {M t )\M}} 
= E [p jtj {M) ( Pjij (M) + t (Ap jtj ) (M) + 0(t 2 ))] 
= E[ Pj (M) 4 ] + tE[p hj {M) ■ A Pj)j (M)} + 0(t 2 ) 

= E[^(M) 4 ] + tE Pj,j(M) ( -{n - Vui' jM) - j Pj (M)J2Pi,j-i( M ) ~ J 2 P2j(M) 
i-i 



n 
i 



v i=i 

2„ 



+ 0(t 2 ) . 



Again by Lemma 11.51 and part (i) of Lemma 12.21 

Eip^Mfp^Mt)} = E [ Pj (M) 3 E[p 3 (M t )\M]} 
= E [pj(M) 3 (( Pj (M) + t (Apj) (M) + 0(t 2 ))] 



E[ Pj (M) 4 } +tE 



Pj(M) 



(n - l)j 



■ J I ■ 3-1 \ 

pj(m) - j -J2pu-i(m) + 3 -J2P2i-AM) 

1=1 1=1 ' 



+ 0(t 2 ) 
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Therefore, 

E [{W t , 3 - W 3 f] = 2E [ P] {Mf] - 8(E[p,.(M) 4 ] - t^^E^(M) 4 ] 

. i-i . 3-1 

4 J2 E [Pi(M)*Pu-i(M)] + 4 E E b-(M) 3 p 2i -i(M)] 
i=i z=i 

+ 6 [E [p,(M) 4 ] - *(n - l)j'E [ Pl {Mf] -tj^E [ Pj (M) 3 Pltj ^(M)] 
\ i=i 

- tj 2 E [p 2j (M) Pj (M) 2 ] + tj^E [Pj(M) 3 P2i-j(M)] + tfnE [ Pj (M) 2 U + 0(t 2 ) 

i=i J 

3-1 3-1 

= -2t{n - 1)jE [ P] {MY] - 2tjJ2 E HMfpij-iiM)] + 2tj ^E [ Pj {Mf P2l ^{M)] 

i=i i=i 

- QtfE [ P2j (M) Pj (M) 2 ] + QtfnE [ Pj (M) 2 ] + 0(t 2 ). 

Case 1: j is odd. Then by Lemma [2.51 

E [(Wtj - Wj) 4 ] = -2t{n - l) 3 • 3j 2 - + - 6tf ■ j + 6tfn ■ j + 0(t 2 ) 
= 0(t 2 ), 

as claimed. 

Case 2: j is even. Then, again by Lemma [2.51 

3-i 

E [(Wtj - W 3 f] = -2t(n - l)j(l + 6j + 3j 2 ) - 2tj(l + 3j) J> k;-*(M)] 

i=l 

3-1 

+ 2tj(l + 3j)J2^\P2i-j(M)} - mf(l+j) + 6tj 2 n(l+j) + 0(t 2 ) . 
i=i 

Consider the term Yui=i ^ \Pi,j-i(M)]- Since E [pjj_j(M)] = whenever I is odd and I ^ |, 
we can write 

3-1 3/2-1 

E E biJ-«W] = E^ /2J/2 (M)] + ^ E[p 2fcj _ 2fc (M)] 

Z=l fc=l 

3/2-1 

= E [p i/2)i/2 (M)] + £ 1, 



k=l 



where the last equality follows again by Lemma [2.5[ If j/2 is odd, then, clearly, k ^ j /A is 
not really a restriction, and by Lemma 12.51 
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3-1 



2 V 2 



If j/2 is even, then by Lemma [2.51 



z=i 



E e mm)]= i+ + -2Uj-i. 



Hence, in either case we have X^=i E [pjj_j(M)] = j — 1. 
Furthermore 



3-1 



j/2-1 



E E [paj-^M)] = 2 E E [pa-j-CM)] + E [Tr(/„)] = 2 i-l +n = j- 2 + n. 



z=i 



i=i 



Therefore, if j is even 



E [(W y - H/,) 4 ] = -2tj{n - 1)(1 + 6j + 3j 2 ) - 2tj(l + 3j)(j - 1) 
+ 2tj{l + 3j)(j - 2 + n) - 6tj 2 (l + j) + 6tfn(l + j) + 0(t 2 ) 
= tj (-12(n - 1) - 2 + 6 + 2 + 6(n - 2) - 6 + 6n) 
+ tj 3 (-6(n - 1) - 6 + 6 - 6 + 6n) + 0(t 2 ) 
= 0(t 2 ), 

as asserted. □ 
Now we are in a position to check condition (Hi)' of Prop. 11.11 By Holder's inequality 



E[||Wi-W||i] = E 



E (Wtj-wA 

=d-r+l J 



3/2' 



E 



d \ 

E (W td - WtfiWtt - W k f{W tjl - wtf 

\j,k,l=d—r+l 



< e E [i(w^-wo)(^t,*-^)(^-wi)i] 

j,k,l=d— r+1 

< E ( E 0^ - ^f] E [l^i,* - E [\ w t,i - w i\ 3 }) 

j,k,l=d-r+l ^ ' 



1/3 



Thus we have 



^E [||W t - Hi 3 ,] < E (t e D^w - ^'l 3 ] 7 E - 7 E W W U ~ 



j,k,l=d—r+l 



1/3 
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and it suffices to show that for all j = d — r + 1, . . . , d 

Km~E[\W tJ -W j \ 3 }=0. 
But this follows from Lemma I4.4[ since by the Cauchy-Schwarz inequality 
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E[|W tii -^| 3 ] < y/E [\W t>j - Wtf] E [\Wtj - Wtf] = V(tf(n - 1) + (>(*•)) • 0(f) 

and hence ±E [\W tJ - Wj\ 3 ] '4° 0. 

By Prop. 11.11 we can now conclude that 



d w (W,Z s )< \\A-%JE[\\R\\ 2 



Clearly, ||A % p = (w _ 1)( 2 d _ r+1) 
first prove the following lemma. 



and ||X- 1/2 || 



°P jd-r+l ■ 



|S-^|| op e[||^||h S ; 



In order to bound E[||i?|| 2 ] we will 



Lemma 4.5. If n > Ad + 1, for all j = d - r + 1, . . . ,d we have that E[Rj] = 0(f 
Proof. First suppose that j is odd. Then 



E[i? 2 ] = E 

2 n- 1 



■ j-l ■ 3-1 



i=i 



r 



i=i 
j-i 



i,fe=i 



v Z,fc=l 



(T\ - 2T 2 + T 3 



By Lemma 12.51 for j odd 



Elpu^M^kj.^M)} 



+ if fc = 1 is odd, 

(Z + l)(j — Z), if A; = I is even, 

10'- * + 
o, 



if I is odd and k = j — Z, 
if Z is odd and k £ {l,j — /}, 



(Z + l)(j — Z), if Z is even and k = j — I, 



0. 



if Z is even and k ^ {Z, j — Z}, 



24 



CHRISTIAN DOBLER AND MICHAEL STOLZ 



Therefore, 



and 



E\p ljJ . l (M)p 2k ^ j (M)] 



E[p 2J _ j (M)p 2fe _ i (M)] 



3-1 



'I, 


if I = 2k - j, 


3-1, 


if 1 = 2] - 2k, 


< I, 


if I = j - 2k, 


j-l, 


if I = 2k, 




otherwise, 




, if I = k or I 




otherwise. 




i-i 



T x = 2^/(j-/ + 1) + 2^(/ + 1)(j-/) 



i=i 

l odd 



1=1 
I even 



< 2,^1 f(.l 1) Oi/ 



z=i 



To 



2 E i+2 EO'-o<2X)/=j(j-i) = o(j 2 ) 

z=i 



i=i i=i 

l odd Z even 



3-1 



T 3 = 2^|2/-j|<2^j = 2j 2 = 0(j 2 ). 



z=i 



i=i 



Hence for j odd 



E[i? 2 ] = ^ (0(j 3 ) - 2 0(j 2 ) + 0(j 2 )) = 0(f) . 
The case that j is even can be treated similarly as can be seen by observing that in this case 



R i 



J 



3-1 3-1 

i=i 



i=i 



But as in Case 2 of the proof of Lemma [4.41 (ii), one has to distinguish between the cases 
that j = 0mod4 or else that j = 2 mod 4, and the explicit formulae for E[i?|] are more 
complicated. □ 

By Lemma \4. 51 there is a constant C > neither depending on j nor on n such that E[i2|] < 
Cj 5 . Thus by Jensen's inequality and the binomial theorem we obtain 
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n\\Rh] < 



E m]} 



< 



--d-r+l 



\ 



j=d— r+1 



\ 



E^'+ d - 



i=i 



\/C £ (j 5 + 5/(d - r) + 10j 3 (d - r) 2 + 10j 2 (d - r) 3 + 5j(d - r) 4 + (d - r) 5 ) 



1/2 



0=1 



< VC (r 6 + 5r 5 (rf - r) + 10r 4 (rf - r) 2 + Wr 3 (d - r) 3 + 5r 2 (d - r) 4 + r((2 - r) 5 ) 



< v^v 7 ^ (max{r 6 , (d - r) 5 r}) 7 = O (max{r 3 , (d - rf' 2 ^}) . 
Next, we turn to bounding E[||S'||hs]- By Lemma [2751 we have 



(13) 



n\\3\U= E E[S»J = £ j 4 E[l-2 P2 ,(M)+ P2 ,(M) 2 ] 



j,k=d— r+1 



j=d-r+l 



+ £ j 2 fc 2 E [p,_ fc (M) 2 - 2 Pj _ k (M) Pj+k (M) + Pj+k (M) 2 } 



j,k=d—r+l 



2 E ^' 5+2 E j 2 ^ E fe'-fc( M )V 2E ^^ 

jr'=d-r+l d-r+l<fc<j<d 



Again, by Lemma [2.51 



E [ Pj - k (M) 2 } 
E\p^ k (M) Pj+k (M)] 
E [p J+k (M) 2 ] 



Hence, 



1+j — k, if j + k even, 
j — fc, if j + k odd, 

1, ]£ j + k even, 
0, if j + fc odd, 

1 + j + k, if j + k even 
j + k, ii j + k odd. 



EOl^llls] 



2 E ^' 5+2 E j 2 ^ 2 (i+j-fc-2+i+j + fc) 



j=d-r+l 



d—r+l<k<j<d 
k+j even 



+ 2 e j 2 k 2 (j-k-2-o+ 3 + k) = 2 e /+ 4 E k 

j=d-r+l d—r+l<k<j<d 



-r+l<fc<j<d 
k+j odd 



From f [T3|) we know that 



26 



CHRISTIAN DOBLER AND MICHAEL STOLZ 



d r 

f = ^2(3 + d - r f < 32 max{r 6 , (rf - r) 5 r}, 

j=d-r+l j=l 

and furthermore we can bound 

£ k 2 f< £ i 2 (i-(rf-r + l))j 3 

d-r+l<fc<j'<d j=d-r+2 
r r 

= £(j + - r) 2 (j + - r - (d - r + l))(j + d - r) 3 = £(j + rf - r) 5 (j - 1] 

3=2 J=2 



r 



— r + <^ ~ r ) 5 — 32r max{r 6 , (rf — r) 5 r} = 32 max{r 7 , (rf — r) 5 r 2 } . 



Thus, we obtain 



E[||S||h S ] < 64max{r 6 , (rf - r) 5 r} + 128max{r 7 , (rf-r)V} 
= (max{r 7 , (rf - r) 5 r 2 }) , 



and hence, by Jensen's inequality 



n\\S\\m] < y/E[\\Sr w ] = O (max{r 7 / 2 , (rf - rf'h}) , 
Collecting terms, we see that 

dw(W,Zx) < ||A- 1 || op E[||i?|| 2 ] + -^||A- 1 || op ||S- 1 / 2 || op E[||5|| H s] 

V Z7T 



2 

— O (max{r 3 , (rf - r) 5/2 y/r}) 



(n-l)(d-r + l) 

+ "7^7 1U / , 1W 2 ° Hax{r 7 / 2 , (rf - rf/ 2 r}) 

\/27r m — 1 « — r + 1 d ' 



2^(n-l)(rf-r + l) 3 / 2 

r 3 r 7 / 2 (d-rf/ 2 ^ (d_ r )5/2 r 

d-r+1' (d-r+1) 3 / 2 ' d-r+1 ' (d-r+1) 3 / 2 

n 



max 



Proceeding as in the unitary case and treating the cases that r < d — r and d — r < r 
separately, one obtains that the last expression is of order 

1 f r 7/2 
This concludes the proof of Theorem 14.11 
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5. The symplectic group 

Let M = M n be distributed according to Haar measure on K n = USp 2n and let d e N, r = 
l,...,d. Consider the random vector W := W(d, r, n) := (/ d _ r+1 (M), f d _ r+2 (M), f d (M)f 
where 



f r .USp 2n ^R, J) 



Pj, j odd 
Pj + 1, j even. 

Let Z = (Zd-r+i, ■ ■ ■ , Z d ) T denote an r-dimensional real standard normal random vector, 
£ := diag(d — r + 1, d — r + 2, . . . , d), and write Z% '■— T}/ 2 Z. 
The objective of this section is to prove the following 

Theorem 5.1. 

If n > 2d, the Wasserstein distance between W and Z^ is again of the same order as in the 
unitary case, namely, as given in ([6]). 

For the proof, we use again exchangeable pairs (W, W t )t>o such that 

W t = (f d . r+1 (M t ), f d - r+2 (M t ), f d (M t )) T , 

and apply Proposition 11.11 The verification of condition (i) involves the following analog of 
Lemma 14.21 above. 

Lemma 5.2. For all j — d — r + 1, . . . , d, 

E[W t>j - W \M] = nfAMt) - fj{M)\M] = t ■ (- ^±^l fj (M) + R 3 + 0(f)) , 



where 



■ 3-1 ■ 3-1 

R i = -nZ^Pi,j-i( M ) ~ 7;/^P2i-j( M ) ifj is odd, 



2 ^ L ,J v ' 2 
i=i i=i 

Rj = 2 ~ ~ Y,Pi,i-i(M) - \ £>-i(M) ifj is even. 

i=i i=i 

Proof. First observe that always fj(M t ) — fj{M) = pj(M t ) — pj(M), no matter what the 
parity of j is. By Lemmas 11.51 and 12.31 

Elp^Mt) - Pj (M)\M] = t(Ap 3 )(M) + 0(t 2 ) 



i=i i=i 



which is 

3-1 ■ 3-1 



(2n + l)j 

/ ! 

if j is odd and which is 
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<(-^±%(M) + (3z±M -i'jZp^(M) + °<«) 

if j is even. This proves the lemma. □ 
From Lemma [5.21 we conclude that 

-E[W t -W\M}^ -AW + R a.s. and in L^P) , 

where A = diag ^2n±Di , j = d — r + 1, . . . ,d) and R = (Rd- r +i, ■ ■ ■ , Rd) T ■ Thus, condition 

(i) of Prop. II. H is satisfied. The validity of condition (ii) follows from the next lemma, whose 
proof only differs from that of Lemma 14.31 in that it makes use of Lemma 12.31 in the place of 
Lemma 12.21 

Lemma 5.3. For all j,k = d - r + 1, . . . ,d, 

E[( Pj (M t )- Pj (M))( Pk (M t )-p k (M))\M] = t(jkp 3 . k (M)-jkp J+k (M)) + 0(t 2 ) . 

Observing that for all j — d — r + 1, . . . , d we have fj(M t ) — fj(M) = pj(M t ) — Pj{M) and 
using Lemma [5.31 we can now easily compute 

^E[{W tJ - Wi){W t , k - W k )\M] = ^E[(fj{M t ) - fj(M))(f k (M t ) - f k (M))\M] 
= jEl(p J (M t )-p J (M))(p k (M t )-p k (M))\M}=jkp ] - k (M)-3 

^ jkpj_ k (M) - jkp j+k (M) a.s. and in L X (P) , 

for all d — r + 1 < j, k < d. Noting that for j = k the last expression is 2j 2 n — j 2 p 2 j(M) and 
that 2AS = diag((2n + l)j 2 , j — d — r + 1, . . . , d), we see that condition (ii) of Prop. II. H is 
satisfied with the matrix S = (Sj tk )j, k= d- r +i,...,d given by 



S 



-f(l+p 2j (M)), j = k 

jk Pj _ k (M)-jkp J+k (M), j^k. 



The validity of condition (Hi') of Prop. 11.11 is based on the following lemma, which can be 
proven in the same way as its analog in Section HI 

Lemma 5.4. If n > 2d, for all j = 1, . . . , d there holds 

(i) E [(Wtj - Wj) 2 } = tf(2n + 1) + 0(t 2 ). 

(ii) E[(W tJ -W ] f} = 0(t 2 ). 

So we obtain from Prop. 11.11 that 

d w (^^)<l|A~1o P rE[||i2|| 2 ] + -^||E- 1 / 2 || op E[||5|| H s]) • 
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Again, it is easy to see that ||A _1 ||o P = (2n+1)( 2 d „r+i) and ll s ~ 1/2 ||o P = Vd l r+1 ■ Since we 
can show in a similar way as in Section H] that E[||i?||2] = O (max{r 3 , (d — r) 5 / 2 y / r}) and 
E[||S'||hs] — O (max{r 7//2 , {d — r) 5//2 r}), we may conclude the proof of Theorem 15. II as before. 
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